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Background: Proton Magnetic Resonance Spectroscopy ( 1 H-MRS) is a non-invasive 
imaging technique that enables quantification of neurochemistry in vivo and thereby 
facilitates investigation of the biochemical underpinnings of human cognitive variability. 
Studies in the field of cognitive spectroscopy have commonly focused on relationships 
between measures of N-acetyl aspartate (NAA), a surrogate marker of neuronal health 
and function, and broad measures of cognitive performance, such as IQ. 

Methodology/Principal Findings: In this study, we used 1 H-MRS to interrogate 
single-voxels in occipitoparietal and frontal cortex, in parallel with assessments of 
psychometric intelligence, in a sample of 40 healthy adult participants. We found 
correlations between NAA and IQ that were within the range reported in previous studies. 
However, the magnitude of these effects was significantly modulated by the stringency of 
data screening and the extent to which outlying values contributed to statistical analyses. 

Conclusions/Significance: 1 H-MRS offers a sensitive tool for assessing neurochemistry 
non-invasively, yet the relationships between brain metabolites and broad aspects of 
human behavior such as IQ are subtle. We highlight the need to develop an increasingly 
rigorous analytical and interpretive framework for collecting and reporting data obtained 
from cognitive spectroscopy studies of this kind. 



Keywords: proton magnetic resonance spectroscopy, N-acetyl aspartate, cognition, IQ, processing speed, 
cognitive spectroscopy 



INTRODUCTION 

Parallel refinements in neuropsychological assessment and neu- 
roimaging techniques now enable the neurophysiological basis of 
individual variation in cognitive ability to be probed with increas- 
ing methodological precision (Jung and Haier, 2007; Haier, 2009). 
Proton magnetic resonance spectroscopy ('H-MRS) provides 
non-invasive quantification of neurochemicals and their metabo- 
lites in pre-defined regions of tissue, with a typical spatial res- 
olution on the order of cubic centimeters. Voxel size is limited 
by a trade-off between the signal-to-noise ratio (SNR) and tissue 
specificity. With smaller voxel sizes, specificity of voxel localiza- 
tion, and therefore tissue type, is increased, but SNR is reduced 
(Freeman, 2003). 

The use of 1 H-MRS as a prognostic and diagnostic tool is well 
established (Danielsen and Ross, 1999; Burlina et al., 2000; Tran 
et al, 2009; Lee et al, 2012). Changes in the 'H-MRS spectrum 
reflect general pathology, such as demyelination or ischemia, and 
is associated with an array of neurological conditions, such as 
brain tumors (Hollingworth et al, 2006; Hou and Hu, 2009) 
multiple sclerosis (Gonzalez-Toledo et al., 2006; de Stefano and 
Filippi, 2007), traumatic brain injury (Marino et al., 2011), and 
neurodegenerative diseases such as Alzheimer's (Kantarci, 2007; 
Loos et al., 2010) and Parkinson's disease (Firbank et al., 2002; 
Rango et al, 2007). For some conditions, 'H-MRS has been used 
to identify a specific biomarker that aids diagnosis. For example, 



in Canavan's disease, a degenerative condition characterized by a 
progressive loss of myelin, there is a specific, robust increase in the 
N-acetyl aspartate (NAA) peak, reflective of the increased volume 
of this metabolite (Matalon et al, 1988; Namboodiri et al, 2006). 

NAA is located primarily in pyramidal neurons, dendrites and 
axons, where it is involved in myelin lipid metabolism, neuronal 
osmoregulation and axon-glial signaling (Moffett et al., 2007). 
NAA is the most prominent and stable signal obtained with 1 LI- 
MRS in the human brain and provides a surrogate marker for 
the health and viability of neural tissue (Barker, 2001; Moffett 
et al., 2007), with concentrations in white matter reflecting the 
metabolic function of axons as well as the extent and efficiency of 
their myelination (Bjartmar et al, 2002). NAA may also enhance 
mitochondrial energy production from glutamate (Moffett et al., 
2007) and function as a molecular water pump (Baslow, 2002; cf. 
Moffett et al., 2007), thereby increasing the speed and efficiency 
of neuronal signaling. 

The putatively important role of NAA in neural tissue has 
been extended to investigations at more macroscopic levels of 
analysis, for example toward developing an understanding of the 
potential biochemical underpinnings of human cognitive abil- 
ity. One general approach has been to measure NAA in cortical 
tissue in conjunction with psychometric measures of cognitive 
skills such as those provided by standardized IQ tests. In young, 
healthy adults, NAA in occipitoparietal white matter correlates to 
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moderate effect with timed measures of neuropsychological per- 
formance, but less so with metrics of cognitive ability derived 
from non-timed tests (Jung et al, 1999a,b). NAA has also been 
demonstrated to be a predictor of moderate effect for Full-scale 
IQ (FSIQ), a construct derived from both timed and non-timed 
psychometric subscales (Jung et al., 2005). 

Correlations between NAA and IQ variables vary not only 
with the IQ subscale assessed, however, but also with the cor- 
tical region and tissue type interrogated by the MRS voxel. For 
example, whilst small, positive correlations have been demon- 
strated between NAA concentration in anterior gray matter and 
verbal IQ, small, negative correlations have also been reported 
between NAA in posterior gray matter and measures of perfor- 
mance IQ (Jung et al., 2009). These patterns of result obtained 
with healthy adults suggest that NAA measured across different 
cortical regions co-varies with independent constructs of cogni- 
tive ability and with tissue type. This general pattern of effect has 
been observed in children (Yeo et al., 2000; Ozturk et al., 2009), 
adolescents (Gimenez et al., 2004; Aydin et al., 2012) and in older 
populations (Valenzuela et al., 2000; Ferguson et al., 2002; Ross 
et al., 2005; Charlton et al, 2007; Kochunov et al, 2010). In addi- 
tion to these regionally specific effects, recent reports propose that 
whole brain NAA volumes may be positively correlated with both 
composite IQ measures and educational achievement in healthy 
adults (Glodzik et al., 2012), suggesting that both genetic and 
environmental variables mediate the relationship between NAA 
and cognitive variables. 

Table 1 summarizes the results from published studies report- 
ing the relationship between NAA and IQ variability. Of partic- 
ular note is the substantial variability in the magnitude of the 
correlations between 'H-MRS-detectable NAA and targeted IQ 
constructs. These studies are also characterized by methodolog- 
ical differences in spectroscopic protocols, the neuroanatomical 
location of voxel placement, the choice of neuropsychological 
constructs tested, the population sampled and the data screening 
and statistical analysis strategies employed. 

Coupled with the wide range of effect sizes reported for the 
sample of studies in Table 1, the substantial methodological vari- 
ability between studies poses significant challenges for drawing 
strong inferences about the strength of the relationship between 
neurometabolites obtained with ^-MRS and IQ variables. This 
difficulty is compounded by an apparent multiple comparisons 
problem that arises when studies have measured neurometabo- 
lites over multiple brain regions, and alongside a battery of intel- 
ligence measures, without detailing specific planned statistical 
comparisons a priori. Such studies may be particularly susceptible 
to inflated Type 1 error, resulting from the potentially large num- 
ber of statistical tests generated from crossing multiple levels of 
different independent variables. 

The aim of the current study was to assess the strength and 
significance of the relationship between ^-MRS-detectable NAA 
volumes and cognitive ability, and specifically the different rela- 
tionships proposed between timed and untimed IQ subscales 
(Jung et al, 1999a,b). The NAA resonance within white mat- 
ter likely reflects both the metabolic function of the neuronal 
axons as well as the extent and efficiency of their myelina- 
tion. Correspondingly, given NAAs putative regulatory role in 



neurophysiological processing speed — which may be a function 
of its important regulatory role within myelin lipid synthesis — 
we predicted strong associations between volumes of NAA in 
frontal (FRO) white matter and the Information Processing Speed 
(IPS) index from the Wechsler Adult Intelligence Scales (WAIS; 
Wechsler, 1997b). Furthermore, we predicted that the correla- 
tion between NAA and IPS would be stronger than that found 
for general cognitive ability [indexed by the WASI full-scale IQ 
(FSIQ)]. As a control condition, we measured NAA in voxels in 
occipitoparietal (OP) cortex, for which the same pattern of strong 
covariance with IQ measures was not predicted. 

MATERIALS AND METHODS 
ETHICS STATEMENT 

Written, informed consent was obtained from all participants 
under a protocol consistent with the tenets of the Declaration of 
Helsinki and with the approval of Aston University's Institutional 
Review Board and the Black Country Research Ethics Committee 
in the United Kingdom (08/H1202/38). 

PARTICIPANTS 

We recruited 44 healthy volunteers (29 females) from the local 
population and from the university student body (mean age: 21.1, 
SD = 3.5). Participants were screened prior to testing to exclude 
the presence of probable neurological dysfunction, including pre- 
vious serious brain injury, history of learning disability, neurolog- 
ical disease, psychiatric diagnosis or current use of psychoactive 
medication. The data from four participants were excluded at the 
outset due to poor-quality spectra resulting from susceptibility 
or motion artifact. Forty participants (27 females) were therefore 
included in the final analyses. The sample provides the study with 
statistical power in excess of 70% to detect moderate correlations 
of 0.4 and above, with statistical significance evaluated at an alpha 
level of 0.05 (Friedman, 1968). 

NEUROPSYCHOLOGICAL ASSESSMENT 

The neuropsychological test battery comprised standardized 
indices of cognitive ability including IPS (Digit-Symbol Copy and 
Symbol Search from the WAIS) and FSIQ from the Wechsler 
Abbreviated Scales of Intelligence (WASI) (Wechsler, 1997a). All 
neuropsychological tests were completed in a single session and 
within 1 week of the neuroimaging protocol. 

NEUR0IMAGING 

The 'H-MRS data were obtained with a Siemens 3T Trio scan- 
ner (Siemens Medical Solutions, Berkshire, UK) using standard 
acquisition software and a quadrature head coil. Single-voxel 1 IT- 
MRS was performed following localization of the volume (8 cm 3 ) 
of interest using a 5 -plane localizer (TR = 20 ms, TE = 5 ms, 10 
slices at 5 mm thickness). For the left frontal (FRO) VOI, the voxel 
was positioned in the anterior cerebrum, centered superior to the 
lateral ventricles and avoiding overlap with non-neural sources 
and the corpus callosum. For the left occipitoparietal (OP) VOI, 
the voxel was placed in a region encompassing both occipital cor- 
tex and the inferior parietal lobule, with placement positioned to 
avoid overlap with the cerebellum (see Figure 1A). Manual voxel 
positioning maximized the contribution from white matter and 
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Table 1 | Summary of published investigations of the relationships between N-acetyl aspartate (NAA) and constructs of cognitive abilities in 
healthy samples. 



Study 



Age in 
years (SO) 



Region(s) 



Method 



Key result 



Correlation 
coefficient (r) 



Aydin et al., 
2012 



30 



15.1 (0.75) 



Genu, midbody and 
isthmus/splenium of 
corpus callosum 



STEAM; Absolute 
concentrations 



NAA in isthmus/splenium 
region positively 
correlated with PIQ a and 
FSIQ b 



0.57 a (p= 0.001), 
0.55 b (p = 0.002) 



Kochunov 38 67.9(9.1) Left/right FRO WM PRESS; Absolute 

et al., 2010 (forceps minor area) concentrations 



NAA correlated with 
"psychomotor" cognitive 
processing speed 



0.22 (p < 0.05) 



Jung et al., 63 23.7 (4.2) Left/right, PRESS; Absolute 

2009 posterior/anterior G/WM concentrations 



Lower right anterior GM 
NAA predicted VI Q c . 
Higher posterior GM 
NAA predicted PIQ d 



0.01 c (p= 0.011), 
0.01 d (p= 0.006) 



Ozturketal., 51 12.3(3.6) Frontal WM, DLPFC, 

2009 parietal WM, inferior 

parietal cortex, dorsal 
parietal cortex 



Multisection spin-echo; 
Ratio to Cr 



Left FRO WM NAA/Cr 
correlated with Purdue 
Pegboard right-hand raw 
scores e . Right FRO WM 
NAA/Cr correlated with 
SB-IV'Bead Memory' raw 



0.01 e (p = 0.047), 
0.01 f (p = 0.032) 



Charlton 78 
et al., 2007 



Jung et al., 27 
2005 



58.2 Centrum semiovale WM 



24.8(5.9) Left OR left FRO and 
right FRO WM 



CSI; Absolute 
concentrations 



PRESS; Absolute 
concentrations 



NAA positively correlated 
with general composite 
measure of cognitive 
performance 

Combination of higher 
left occipital WM and 
lower FRO WM NAA 
correlated with FSIQ in 
women 



0.33 ¥ (p < 0.01) 



0.67 (p < 0.001) 



Ross et al. 
2005 



59 



70.8 Left frontal WM, midline 

OP GM 



STEAM; Ratio to Cr and 
Ratio to water 



Frontal WM NAA/H 2 0 
correlated with a 
composite measure 
representing speed of 
information processing, 
attentional function and 
visual memory 



0.32 (p = 0.032) 



Gimenez 21 14(2.3) Left medial temporal 

et al., 2004 cortex 



PRESS; Ratio to Cho 



NAA/Cho related to free 
recall 8 and recognition h 
memory abilities 



0.569 (p= 0.008), 
0.51 h (p = 0.018) 



Pfleiderer 
etal., 2004 



62 



38.5(15.4) DLPFC, left ACC 



STEAM; Absolute 
concentrations 



NAA positively correlated 
with Verbal Intelligence in 
women in left DLPFC and 
left ACC 



0.53 (p = 0.009) 



Ferguson 
et al., 2002 



65-70 Left parietal WM 



PRESS; Ratios to Cr, 
Ratios to Cho, "adjusted 
metabolite values" 



NAA/Cr correlated with 
Logical Memory, but 
effect due to Cr (Cr 
negatively correlated 
with Logical Memory) 



0.24 (p < 0.05) 



(Continued) 
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Table 1 | Continued 



Study 


n 


Age in 
years (SD) 


Region(s) 


Method 


Key result 


Correlation 
coefficient (r) 


Valenzuela 
et al., 2000 


20 


72 


Left OP and left FRO and 
WM 


STEAM; Ratios to Cr 


Frontal WM NAA/Cr 
correlated with 
executive-attentional 
cognitive ability 


0.61 (p < 0.05) 


Yeo et al., 
2000 


20 


9.48 (1.67) 


Right FRO WM 


STEAM; Absolute 
concentrations 


NAA positively correlated 
with Working Memory 
(Visual two-back test) 


0.45 (p = 0.04) 


Jung et al., 
1999a 


26 


22 (4.6) 


Left OP WM 


STEAM; Absolute 
concentrations 


NAA positively correlated 
with FSIQ' (p < 0.001) 
and PIQi 


0.52' (p < 0.001), 
0.4& (p < 0.001) 


Jung et al., 
1999b 


45+ 


23 (4.9) 


Left OP WM 


STEAM; Absolute 
concentrations 


NAA correlated with 
timed tasks k to greater 
degree than general IQ' 


0.66 k (p< 0.001), 
0.59' (p < 0.001) 



The principal findings from each study are summarized, along with key study characteristics and the associated correlation coefficients (r). The MRS acquisition 
method for each study is presented (STEAM, stimulated echo acquisition mode; PRESS, point-resolved spectroscopy) and whether absolute concentrations or 
relative (ratio) quantification was used. ACC, anterior cingulate cortex; DLPFC, dorsolateral prefrontal cortex; FRO, frontal; GM, gray matter; OP occipitoparietal; 
WM, white matter. 

f 26 participants' data previously published in Jung et al. (1999a). 
¥ beta-weight. 




FIGURE 1 | (A) Approximate localization of ^-MRS voxels in frontal 
(FRO) and occipitoparietal (OP) cortex shown in the sagittal and 
axial planes with a 5-plane localizer {TR = 20 ms, TE = 5 ms, 10 



slices at 5 mm thickness). (B) Representative 'H-MRS spectrum 
from a FRO voxel; Original data shown with the LModeled data 
overlaid (heavy line). 



minimized the contribution from gray matter. Automated shim- 
ming was followed by a STEAM pulse sequence ( TR = 2000 ms, 
TE = 30 ms, 96 averages), including water suppression, for both 
frontal and occipitoparietal voxel acquisitions. 

1 H-MRS DATA SCREENING 

The data were screened in a two-stage process to ensure their 
reliability and validity prior to statistical analyses. In the first 
stage, spectra were visually inspected. High-quality spectra were 
identified by narrow, well-resolved peaks, a reasonably flat base- 
line and the presence of Hunter's Angle, a 45° angle connecting 
the major ^-MRS peaks in normal tissue. Poor quality spec- 
tra were identified by distorted baselines and broadened spectral 



line widths (increased full-width, half maximum peak height). 
Online data screening allowed reacquisition of data if necessary, 
following adjustment of voxel positioning, and therefore maxi- 
mized the probability of acquiring high-quality data from every 
participant. 

To validate data quality, assess reliability, and quantify error 
variance introduced by scanner-related factors such as that caused 
by field inhomogenieties, data from two successive scans of 
the same voxel were obtained and averaged for each partic- 
ipant. Intraclass correlation coefficients across these two esti- 
mates were 0.947 and 0.910 for the FRO and OP voxels, 
respectively, demonstrating a high degree of test-retest reliability. 
The coefficient of variance within each voxel (9% in FRO and 
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8% in OP) was consistent with spectroscopic data published 
previously (Ross et al., 2005). 

All data were analyzed with LCModel (Provencher, 2001), 
a software package that provides spectral quantification and 
metabolite detection optimized for short echo time spectroscopy. 
A representative 'H-MRS spectrum acquired from FRO voxels, 
showing original data with the modeled data (using LCModel) 
overlaid, is shown in Figure IB. 

In the second stage of screening, data were excluded if 
metabolite values had a %SD (the estimated standard devia- 
tion, expressed as a percentage of the estimated concentration) 
exceeding 20%, the approximate criterion for acceptable reliabil- 
ity suggested by Provencher (2008). Three data sets were excluded 
by this criterion. Figure 2 shows the LCModel spectra for the 
maximum and minimum NAA/Cr values for FRO and OP vox- 
els of the unscreened data. These data fulfilled the initial basic 
screening criteria for reliability and validity, as described above, 
but were subsequently excluded from data analyses as statistical 
outliers. 

RESULTS 

All analyses reported here adopt the standard convention of 
expressing NAA as a ratio to Cr (NAA/Cr). A summary of the 
descriptive data for the spectroscopic and psychometric measures 
is provided in Table 2. Mean metabolite ratios to Cr were con- 
sistent with those reported previously (Valenzuela et al., 2000; 
Ozturk et al., 2009). FSIQ scores were available for 31 participants 
and IPS scores for all 40 participants. MRS data for OP cortex was 
obtained for all 40 participants and for 36 participants for FRO 
volumes. 



No significant differences in NAA/Cr were observed between 
FRO and OP voxels overall [t (35) = 1.86, p = 0.07]. Similarly, 
there were no main effects of sex on NAA/Cr in either region [OP: 
f (36 ) = 2.78, p = 0.94; FRO: f( 34 ) = 0.02, p = 0.34]. 

Table 3 presents bivariate correlations between the spectro- 
scopic and IQ variables. Here the data are presented in two 
complementary ways. First, the values obtained from all partic- 
ipants are included in the analyses. Figure 2 shows the LCModel 
spectra of the minimum and maximum NAA/Cr values in FRO 
and OP voxels for this unscreened data (see Table 2). Second, 
three univariate outliers, whose NAA/Cr value for either the FRO 
or OP voxel was 2SD or more away from the overall group mean, 
were removed from the analysis. 

The unscreened data did not consistently satisfy distribu- 
tional assumptions of normality (Shapiro-Wilk W >0.05) and 
were therefore analyzed using non-parametric tests of associa- 
tion. In contrast, the distribution of the screened data did not 



Table 2 | Descriptive statistics summarizing cognitive and ^-MRS 
data for the sample. 





Mean (SD) 


Min.-Max. 


Age (years) 


21.2 (3.5) 


18-34 


FSIQ (SS)* 


112.7 (12.3) 


85-153 


IPS (SS) 


108.4 (13.0) 


88-140 


OP NAA/Cr 


1.61 (0.21) 


0.78-1.88 


FRO NAA/Cr 


1.56 (0.25) 


0.42-1.95 



SS, scaled score (mean = 100; SD = 15); FSIQ, Full-scale IQ, IPS, Information 
Processing Speed; OP occipitoparietal; FRO, frontal; *n= 31 for this variable. 
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FIGURE 2 | LCModel spectra of unscreened data showing minimum and (B) NAA/Cr concentrations in FRO voxels, the bottom two panels show the 
maximum values for NAA in frontal (FRO) and occipitoparietal (OP) spectra for minimum (C) and maximum (D) NAA/Cr concentrations in OP 

voxels. The top two panels show the spectra for minimum (A) and maximum voxels. 
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deviate significantly from normality and were analyzed using 
parametric correlations. The relationships between IPS or FSIQ 
constructs and NAA/Cr were neither strong, nor statistically sig- 
nificant, a pattern that was consistent across both OP and FRO 
voxels and both parametric and non-parametric analyses (see 
Table 3). 

DISCUSSION 

Changes in neurometabolite concentrations, particularly for 
NAA, have been linked to psychometric and performance 
variables across a range of disorders of cognitive function, includ- 
ing acute traumatic brain injury, and more chronic conditions 
such as schizophrenia and Alzheimer's disease (Danielsen and 
Ross, 1999; Burlina et al, 2000; Ross and Sachdev, 2004; Tran 
et al., 2009; Lee et al, 2012). This evidence has motivated inves- 
tigations of normal populations to assess the ability of cognitive 
spectroscopy paradigms to identify potential biomarkers for vari- 
ability in human intellectual function and achievement. Several 
studies have reported statistically significant correlations between 
NAA volumes and broad measures of cognitive skill, with small 
to moderate correlations typically found in samples of putatively 
normal populations (see Table 1). 

In our study, we found that quantitative estimates of NAA in 
cortex did not correlate strongly or significantly with standardized 
measures of IQ constructs. The small correlations we obtained 
between NAA concentration and measures of cognitive function 
echo other findings that have reported weak associations between 
IQ and neurometabolite variables in healthy cohorts (Friedman 
et al., 1999; Shim et al, 2001; Filippi et al, 2002; Gimenez et al., 

2004) . However, they also contrast with other demonstrations 
of comparatively higher correlations between these constructs 
obtained with similar research designs (Jung et al, 1999a,b, 

2005) . 

Based on the principal findings of the 14 studies included in 
Table 1, the average correlation between NAA and various mea- 
sures of cognitive ability is 0.39 with a standard deviation of 0.23. 



Table 3 | Correlations between NAA/Cr and psychometric measures 
of cognitive function in frontal (FRO) and occipitoparietal (OP) voxels. 



Parametric Non-parametric 







FSIQ 


IPS 


FSIQ 


IPS 


Unscreened FRO 


n 


31 


40 


31 


40 




NAA/Cr 


0.13 


-0.19 


-0.01 


-0.25 


OP 


n 


31 


40 


31 


40 




NAA/Cr 


0.03 


0.07 


-0.06 


-0.12 


Screened FRO 


n 


28 


37 


28 


37 




NAA/Cr 


-0.04 


-0.29 


-0.1 


-0.26 


OP 


n 


28 


37 


28 


37 




NAA/Cr 


-0.12 


-0.26 


0.05 


-0.25 



Abbreviations as for Table 2. Screened and unscreened data analyzed with 
non-parametric (Spearman's Rho) and parametric (Pearson Product Moment! 
correlations for comparison. 3 outlier values (> ±2SD from cohort mean) 
excluded. Max r= -0.29 (p= 0.078) for Pearson correlation between FRO NAA 
and IPS. 



The large standard deviation relative to the mean suggest the 
presence of inhomogeneous effects found across the population 
of published studies in this area. We suggest that this variability 
resulted at least in part from research design and methodological 
features that differed across studies. We therefore interpret the 
results of the present investigation within the context of these 
inter-study differences. 

RESEARCH DESIGN ISSUES 
Age 

Previous studies employing cognitive spectroscopy of healthy 
populations have varied substantially in the age of the popula- 
tion sampled (see Table 1). Although we observed no significant 
correlations between NAA/Cr and age for either FRO or OP 
voxels in our study, the mean age and standard deviation of 
the participants (see Table 2) was similar to that obtained from 
other investigations of young, healthy adults, some of which have 
reported strong and significant IQ/neurometabolite correlations 
(Jung et al., 1999a,b, 2005). Whilst previous reports suggest the 
presence of age-related neurometabolic changes across the lifes- 
pan (Pouwels et al, 1999; Angelie et al., 2001), there appears to 
be no strong systematic relationship between the age of the study 
sampled and the strength of the IQ/neurometabolite correlations 
(r = 0.15,p = 0.55; see Table 1). Our interpretation of these data 
is that age of the cohort does not account for substantial vari- 
ability in the strength of correlations between NAA and IQ in the 
cohort of previously published studies. 

Sex 

In our data set, we observed no main effect of sex on NAA/Cr 
values (echoing the findings of Charles et al., 1994; Pouwels and 
Frahm, 1998; Komoroski et al., 1999; Gimenez et al, 2004; Safriel 
et al., 2005), nor did we observe sex differences in the strength of 
correlations between NAA and cognitive measures in either FRO 
or OP regions. Few studies have reported sex differences in the 
relationship between NAA and cognitive ability. Two of the 14 
studies in included in Table 1 (Ferguson et al., 2002; Aydin et al., 
2012) studied only males. Pfleiderer et al. (2004) observed that 
NAA in the DLPFC (n = 52; 58% males) and in the left anterior 
cingulate cortex (« = 62; 65% males) was uniquely positively cor- 
related with vocabulary scores (r = 0.53 and 0.56, respectively). 
Jung et al. (2005) observed that occipitoparietal NAA strongly 
correlated with FSIQ scores in women (adjusted r 2 = 0.82), but 
not men (adjusted r 2 = 0.35; n = 27; 63% male). When groups 
divided by sex were reasonably well-matched (n = 63; 54% male), 
Jung et al. (2009) found similar correlations for male and female 
participants (r = 0.34 and r = 0.24), specifically between right 
posterior gray matter NAA and Performance IQ. Jung et al. 
acknowledged that their study was underpowered for analysing 
potential sex differences in the context of other potentially sig- 
nificant factors such as the region and tissue type examined. 
The same limitation applies to our study, in which females were 
overrepresented. 

Region 

We observed no significant differences in NAA/Cr across corti- 
cal regions in our analyses, a result that is in accordance with 
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some studies (for example, Ozturk et al., 2009), but which con- 
trasts with others that have observed significant inter-regional 
differences using both single (Minati et al., 2010) and simulta- 
neous acquisitions of multiple MRS voxels (Angelie et al, 2001; 
Maudsley et al, 2009). While the NAA/Cr ratios, both for our 
study and for those previously published (see Table 1) are consis- 
tent with normative data (Danielsen and Ross, 1999), it would not 
be surprising if the correlations between neurometabolites and 
cognitive function varied according to the cortical region inves- 
tigated, based on knowledge of the functional specialization of 
the cerebral cortex. It was for this reason that we obtained mea- 
sures from FRO voxels, given the importance of this region for the 
cognitive functions assessed with the WAIS measures (Baddeley, 
1996; Duncan et al., 2000), and a control volume in OP cor- 
tex, for which we would not predict the same magnitude of 
relationship. 

Tissue type 

The typically large spatial extent of the voxel from which data 
is acquired in MRS studies makes it difficult to obtain measures 
from homogeneous tissue. However, decreasing the spatial extent 
of the size of the voxel leads to a corresponding decrease in the 
signal to noise ratio (Freeman, 2003). The extent to which MRS 
samples gray or white matter can modulate both the metabo- 
lite values obtained (Wang and Li, 1998; McLean et al„ 2000; 
Wiedermann et al., 2001) and the pattern of NAA/IQ relation- 
ships (Jung et al., 2009). The precision gained from obtaining 
measures from homogeneous tissue, or after correcting for tissue 
inhomogeneity, is a reasonable aspiration for future studies. Of 
the studies included in Table 1, Charlton et al. (2007) is the excep- 
tion in that they excluded voxels containing less than 75% white 
matter. They acknowledged, however, the inevitable reduction in 
sample size (and correspondingly in nominal statistical power) 
that potentially results from applying such post-hoc screening 
criteria in research studies. 

METHODOLOGICAL ISSUES 
Assessments of cognitive ability 

Previous studies have typically adopted one of three general 
strategies to assess cognitive ability psychometrically: (1) broad, 
non-specific measures of cognitive ability, such as those obtained 
by standardized assessments of general intelligence [e.g., FSIQ, 
Verbal IQ and Performance IQ from standardized test batter- 
ies (Wechsler, 1997a,b)]; (2) data reduction techniques to derive 
composite IQ scores for use as dependent variables (Valenzuela 
et al., 2000; Ross et al, 2005; Kochunov et al, 2010; Glodzik et al., 
2012); or (3) measuring more specific cognitive domains such as 
working memory (Yeo et al., 2000) or IPS (Jung et al, 1999b). 
For correlations with NAA, studies (« = 9) that used broad-based 
measures such as FSIQ or a composite score derived by factor 
analysis have an average correlation coefficient (Spearman's Rho) 
of 0.44 (SD = 0.23). In contrast, those studies (n = 4) reporting 
findings for specific cognitive measures yield an average correla- 
tion of 0.30 (SD = 0.25) (see Table 1). A minority of studies have 
framed more specific hypotheses, for example, proposing sex dif- 
ferences in NAA and verbal processing ability (Pfleiderer et al., 
2004) or hemispheric specificity with respect to verbal working 



memory and motor speed (Ozturk et al., 2009). The associated 
correlation coefficients for these relationships is characteristically 
variable (r = 0.67 and 0.01, respectively). 

In our study, we opted for a hybrid approach, assessing both 
a broad measure of IQ and the IPS index, which taps a smaller 
set of abilities in mental and motor speed (Groth-Marnat et al., 
2000). Given NAAs putative regulatory role in neurophysiological 
processing speed (Jung et al., 1999a,b), we predicted, although did 
not find, strong positive associations between levels of NAA in 
frontal (FRO) cortex and WAIS IPS scores (Wechsler, 1997b) (see 
Table 3). 

The methodological approach to data analytic strategy should 
be directly informed by the hypotheses adopted a priori (or 
lack thereof), which then governs and constrains the choice and 
range of psychometric assessments applied and how they are 
treated in statistical analyses. Given the potentially diffuse role 
of NAA in brain function, broader-based measures of cognitive 
function may provide the most robust correlations because both 
MRS-detectable NAA and psychometric scores provide by proxy 
measurement of underlying and potentially correlated constructs. 

Metabolite quantification 

Published studies vary according to whether absolute concen- 
trations of NAA are used or whether NAA values are expressed 
as a ratio to another neurometabolite (typically Cr or Cho). 
Expressing NAA as a ratio to another metabolite confers the 
advantage of correcting for potentially important unknown or 
uncontrollable, yet correlated, experimental factors, such as static 
(B0) and radio frequency (RF, Bl) field inhomogeneity. Ratio 
values continue to be used, for example in studies of mild cog- 
nitive impairment (Nie et al, 2013; Targosz-Gajniak et al, 2013), 
where cognitive spectroscopy research is particularly productive. 
However, the validity of using such ratios may be compromised 
by the potentially under-conservative assumption of the stability 
of the reference metabolite (Li et al., 2003). For example, Cr val- 
ues may change not only in disease states (Maniega et al, 2008) 
but also during the course of normal aging (Haga et al, 2009). An 
alternative strategy is to use one of a number of absolute quan- 
tification methods (Jansen et al., 2006), and in particular using 
the water signal as reference (Keevil et al., 1998). However, this 
method is also subject to limitations. The water fractions within 
gray and white matter differ, which necessitates precise tissue par- 
titioning, a requirement most reliably obtained with voxels of 
small spatial extent and with post-acquisition verification of voxel 
positioning. 

Despite these potential issues, our analysis of the studies in 
Table 1 did not identify any systematic relationship between the 
method of metabolite quantification adopted and the size and 
strength of the reported findings between IQ and NAA [f(is) = 
0.968, p = 0.35 for between group effects using absolute or rel- 
ative quantification methods]. Whilst the quantification method 
does not seem to be a significant factor in this specific context, 
until a standard referencing procedure is widely adopted, com- 
paring data across studies in different domains that differ in this 
variable will remain difficult (de Beer et al., 1995; Knight-Scott 
et al, 2003). Furthermore, investigating multivariate relationships 
using multiple measures of metabolite expressions, such as with 
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both absolute values and ratios, may only compound difficul- 
ties in interpretation because this procedure increases greatly the 
number of potential statistical comparisons within a given study. 

Data screening and analysis 

The procedures used for data screening, coupled with the suit- 
ability of the statistical analysis strategy adopted, may critically 
underpin the validity of the findings gleaned from a given study. 
Differences between studies in Table 1 were also apparent in 
their analytical strategies, including procedures for data screen- 
ing to ensure robust correlational data (Tabachnik and Fidell, 
2001). Data validity depends on several factors, including the 
assumption of univariate and multivariate normality and how 
outlying values are handled in statistical analyses. To date, pub- 
lished investigations have employed highly variable procedures 
for data screening, with little homogeneity of practice in the 
methods of analysis applied, or in the presentation of data. 
Levels of screening range from general removal of poor qual- 
ity spectra (Jung et al., 1999b; Ross et al., 2005), to exclusion of 
metabolite values exceeding a critical statistical cut-off, either fol- 
lowing post-hoc quantification (Ferguson et al, 2002; Pfleiderer 
et al., 2004; Jung et al, 2005) or screening for voxel tissue 
composition, but not the metabolite values obtained (Charlton 
et al., 2007). Other studies (Jung et al., 1999a; Valenzuela et al., 
2000; Yeo et al, 2000; Gimenez et al, 2004; Kochunov et al, 
2010) have not reported details of the data screening procedures 
employed. 

All of the studies in Table 1 employed parametric data anal- 
yses. As demonstrated in the present study, distributional nor- 
mality of spectroscopy data cannot be assumed, yet previously 
published studies have not included information on the normal- 
ity or distributional assumptions of their data. Analysis of our 
unscreened data with parametric measures (which were not justi- 
fied in this case), results in significant correlations between NAA 
and IQ variables on the order of r = 0.40, a finding similar to 
those previously reported (i.e., the average correlation of the stud- 
ies in Table 1 is 0.39). Given that the potential impact of outliers 
and non-Gaussian distributions of data appear to modulate the 
pattern of results, we suggest that the extent and nature of data 
screening employed should be routinely reported in cognitive 
spectroscopy studies. 

Lack of robust data screening of relatively small samples, com- 
bined with the large potential number of statistical comparisons, 
and a lack of specific comparisons identified a priori, can result in 
the inflated probability of Type 1 error. Correspondingly, analyses 
of the studies published to date (Table 1) suggest that as sam- 
ple size increases, the strength of the correlation between NAA 
and IQ constructs tends to decrease (Spearman's Rho = -0.51; 
p = 0.025). 

A key future consideration for studies in cognitive spec- 
troscopy is to distinguish between statistically significant effects 
and ones that are statistically relevant, particularly for cases 
where effect sizes may not be large enough to survive statisti- 
cal adjustments for multiple comparisons or were not identified 
as planned comparisons a priori. Whilst our initial sample size 
of 44 is relatively modest, it falls well within the range of pub- 
lished studies and is sufficiently large enough to detect moderate 



correlations between NAA and cognitive ability with adequate 
statistical power. The results of this quantitative review sug- 
gest that many studies in this area have been underpowered 
statistically to reliably detect subtle relationships between neu- 
rometabolite volumes and constructs of human cognitive ability. 
The subtlety of the relationships motivates the collection of larger 
samples of participants than have been typically obtained to 
achieve the statistical power necessary to reliably detect multiple 
significant effects and their interactions (Jung et al., 2009; Ozturk 
et al, 2009). 

CONCLUSIONS 

^-MRS provides a valuable and sensitive tool for quantify- 
ing neurochemistry in vivo. In this study of healthy adults, we 
adopted strict screening criteria to ensure robust, reliable, single- 
voxel metabolite data and observed statistically weak relationships 
between frontal cortex NAA and FSIQ and IPS. Although not sig- 
nificant statistically, the effect sizes obtained fell in the range of 
those reported in previously published studies. Whilst our null 
findings may be the result of several factors related to methodol- 
ogy and data screening procedures, the lack of a standard practice 
the reporting of data in the literature suggests that conclusions 
drawn from these types of studies still need to be tempered 
by consideration of the methodological framework employed. 
Despite the limitations, there is the potential for ^-MRS to make 
a significant contribution toward bridging the gap from micro- 
to macroscopic levels of analysis that underpin brain-behavior 
relationships. 
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